Parallel machine learning approaches for reverse engineering genome-scale networks
نویسنده
چکیده
Srinivas Aluru School of Computational Science and Engineering Georgia Institute of Technology Reverse engineering whole-genome networks from large-scale gene expression measurements and analyzing them to extract biologically valid hypotheses are important challenges in systems biology. While simpler models easily scale to large number of genes and gene expression datasets, more accurate models are compute intensive limiting their scale of applicability. In this talk, I will present our research on the development of parallel mutual information and Bayesian network based structure learning methods to eliminate such bottlenecks and facilitate genome-scale network inference. As a demonstration, we reconstructed genome-scale networks of the model plant Arabidopsis thaliana from 11,700 microarray experiments using l.57 million cores of the Tianhe-2 Supercomputer. Such networks can be used as a guide to predicting gene function and extracting context-specific subnetworks. Biography Srinivas Aluru is a professor in the School of Computational Science and Engineering at Georgia Institute of Technology. He co-directs the Georgia Tech Strategic Initiative in Data Engineering and Science, and co-leads the NSF South Big Data Regional Innovation Hub. Earlier, he held facuIty positions at Iowa State University, Indian Institute of Technology, New Mexico State University, and Syracuse University. Aluru conducts research in high performance computing, bioinformatics and systems biology, combinatorial scientific computing, and applied algorithms. He pioneered the development of parallel methods in computational biology, and contributed to the assembly and analysis of complex plant genomes. Aluru serves on the editorial boards of the IEEE Transactions on Big Data, IEEE Transactions on Parallel and Distributed Systems, Journal of Parallel and Distributed Computing, and the International Journal of Data Mining and Bioinformatics. He is currently serving as the Chair of the ACM Special Interest Group on Bioinformatics, Computational Biology and Biomedical Informatics (SIGBIO). Aluru is a recipient of the NSF Career award, IBM facuIty award, Swarnajayanti Fellowship from the Government of India, the mid-career and outstanding research achievement awards from Iowa State University, and the College of Computing Dean's award for facuIty excellence at Georgia Tech. He is a Fellow of the American Association for the Advancement of Science (AAAS) and the Institute of Electrical and Electronics Engineers (IEEE). 978-1-4673-6799-81l51$31.00m015 IEEE 3
منابع مشابه
De-Novo Learning of Genome-Scale Regulatory Networks in S. cerevisiae
De-novo reverse-engineering of genome-scale regulatory networks is a fundamental problem of biological and translational research. One of the major obstacles in developing and evaluating approaches for de-novo gene network reconstruction is the absence of high-quality genome-scale gold-standard networks of direct regulatory interactions. To establish a foundation for assessing the accuracy of d...
متن کاملMachine learning algorithms in air quality modeling
Modern studies in the field of environment science and engineering show that deterministic models struggle to capture the relationship between the concentration of atmospheric pollutants and their emission sources. The recent advances in statistical modeling based on machine learning approaches have emerged as solution to tackle these issues. It is a fact that, input variable type largely affec...
متن کاملA New Hybrid Meta-Heuristics Approach to Solve the Parallel Machine Scheduling Problem Considering Human Resiliency Engineering
This paper proposes a mixed integer programming model to solve a non-identical parallel machine (NIPM) scheduling with sequence-dependent set-up times and human resiliency engineering. The presented mathematical model is formulated to consider human factors including Learning, Teamwork and Awareness. Moreover, processing time of jobs are assumed to be non-deterministic and dependent to their st...
متن کاملProtein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches
DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...
متن کاملReverse engineering and analysis of large genome-scale gene networks
Reverse engineering the whole-genome networks of complex multicellular organisms continues to remain a challenge. While simpler models easily scale to large number of genes and gene expression datasets, more accurate models are compute intensive limiting their scale of applicability. To enable fast and accurate reconstruction of large networks, we developed Tool for Inferring Network of Genes (...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015